Docs on hook call order #21120

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

Borda merged 15 commits into Lightning-AI:master from SkafteNicki:docs_hook_order

Sep 1, 2025

Collaborator

SkafteNicki commented Aug 27, 2025 •

edited by github-actions bot

Loading

What does this PR do?

Fixes #21069

Before submitting

Was this discussed/agreed via a GitHub issue? (not for typos and docs)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you list all the breaking changes introduced by this pull request?
Did you update the CHANGELOG? (not for typos, docs, test updates, or minor internal changes/refactors)

PR review

Anyone in the community is welcome to review the PR.
Before you start reviewing, make sure you have read the review guidelines. In short, see the following bullet-list:

Reviewer checklist

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

📚 Documentation preview 📚: https://pytorch-lightning--21120.org.readthedocs.build/en/21120/

SkafteNicki added 2 commits

August 27, 2025 07:03


          add hook order

33bb5fe


          add to index

1ffa4bf

SkafteNicki self-assigned this

SkafteNicki requested review from Borda, ethanwharris, justusschock, lantiga and tchaton as code owners

August 27, 2025 05:04

SkafteNicki added the docs label


          Merge branch 'master' into docs_hook_order

662f671

github-actions bot added the pl label

Borda approved these changes

View reviewed changes

GdoongMathew reviewed

View reviewed changes

docs/source-pytorch/common/hooks.rst

Comment on lines +72 to +75

+                  trainer.fit()
+                  │
+                  ├── setup(stage="fit")
+                  │   └── [Callbacks only]

Contributor

GdoongMathew Aug 27, 2025

just a suggestion. perhaps adding this could also be helpful? as it also shows one of the known issue #19658

    ├── setup(stage="fit")
    │   └── [Callback.setup]
    │   └── [LightnintModule.configure_shared_model]
    │   └── [LightnintModule.configure_model]
    │   └── [Strategy.setup]

docs/source-pytorch/common/hooks.rst

Comment on lines +80 to +82

+                  │   └── [Strategy]
+                  │
+                  ├── on_sanity_check_start()

Contributor

GdoongMathew Aug 27, 2025 •

edited

Loading

perhaps we could also add when the checkpoint and state_dict are load?

on_fit_start()

if strategy.restore_checkpoint_after_setup:
    [LightningModule.on_load_checkpoint]
    [Strategy.load_model_state_dict()

[optimizer.load_state_dict]
[lr_scheduler.load_state_dict]

on_sanity_check_start()

docs/source-pytorch/common/hooks.rst

Comment on lines +209 to +210

		└── teardown(stage="fit")
		└── [Callbacks only]

Contributor

GdoongMathew Aug 27, 2025

I believe that the lightning module also has a teardown callback here.

    └── teardown(stage="fit")
        └── [Callback]
        └── [LightningModule]

docs/source-pytorch/common/hooks.rst

Comment on lines +204 to +209

+                  ├── on_fit_end()
+                  │   ├── [Callbacks]
+                  │   ├── [LightningModule]
+                  │   └── [Strategy]
+                  │
+                  └── teardown(stage="fit")

Contributor

GdoongMathew Aug 27, 2025

from my personal code inspect, I believe that the strategy.teardown should be called before on_fit_end stage, and also the strategy class does not have a on_fit_end callback.

strategy.teardown()
     |--LightningModule.cpu()
on_fit_end()
     |--Callback.on_fit_end()
     |--LightningModule.on_fit_end()
Callback.teardown()
LightningModule.teardown()

docs/source-pytorch/common/hooks.rst

Comment on lines +135 to +140

+                  │   │   ├── on_before_zero_grad()
+                  │   │   │   ├── [Callbacks]
+                  │   │   │   └── [LightningModule]
+                  │   │   │
+                  │   │   ├── [Forward Pass - training_step()]
+                  │   │   │   └── [Strategy only]

Contributor

GdoongMathew Aug 27, 2025

I think the training step is called before the on_before_zero_grad is called?

strategy.training_step()
    | -- LightningModule.training_step()
on_before_zero_grad()
    |-- Callback.on_before_zero_grad()
    |-- LightningModule.on_before_zero_grad()
    |-- LightningModule.optimizer_zero_grad()

docs/source-pytorch/common/hooks.rst

Comment on lines +153 to +155

+                  │   │   ├── on_before_optimizer_step()
+                  │   │   │   ├── [Callbacks]
+                  │   │   │   └── [LightningModule]

Contributor

GdoongMathew Aug 27, 2025

perhaps also add configure_gradient_clipping here?

    │   │   ├── on_before_optimizer_step()
    │   │   │   ├── [Callbacks]
    │   │   │   ├── [LightningModule]
    │   │   │   └── [LightningModule.configure_gradient_clipping]

docs/source-pytorch/common/hooks.rst

Comment on lines +193 to +195

+                  │       ├── [Callbacks - Non-monitoring only]
+                  │       ├── [LightningModule]
+                  │       └── [Callbacks - Monitoring only]

Contributor

GdoongMathew Aug 27, 2025

Perhaps Non-monitoring could be changed to Checkpoint & EarlyStopping to be more specific.

Borda reviewed

View reviewed changes

docs/source-pytorch/common/hooks.rst Outdated Show resolved Hide resolved


          Apply suggestions from code review

f02c944

Collaborator

Borda commented Aug 27, 2025

@GdoongMathew, all good call. How about landing this as a baseline, and you can send just another PR with your suggestions for improvement? :)

Contributor

GdoongMathew commented Aug 27, 2025

From my previous experience, since Lightning uses the closure function in the optimizer, it can make tracing the optimizer step a bit difficult. Maybe we could also highlight that here?

            # closure function for optimizer
            strategy.training_step()
            └── LightningModule.training_step()

            callback.on_before_zero_grad()
            LightningModule.on_before_zero_grad()
            LightningModule.optimizer_zero_grad()
            strategy.backward()
            ├── callback.on_before_backward()
            ├── LightningModule.on_before_backward()
            ├── LightningModule.backward()
            │   └── loss.backward()
            ├── callback.on_after_backward()
            └── LightningModule.on_after_backward()

            callback.on_before_optimizer_step()
            LightningModule.on_before_optimizer_step()
            LightningModule.configure_gradient_clipping()
    
            optimizer.step(closure)

but this also makes my personal note a bit messier as well....

Contributor

GdoongMathew commented Aug 27, 2025

@Borda sure!! will open another PR for those suggestions. thanks~

This comment was marked as off-topic.

Sign in to view

Borda and others added 8 commits

August 27, 2025 13:59


          Merge branch 'master' into docs_hook_order


          Merge branch 'master' into docs_hook_order

0c2ae6b


          Merge branch 'master' into docs_hook_order

ab19e4d


          fixed

945c6e1


          BoringModel

dba5219


          typo

faecbff


          params

39de433


          testoutput

a79c134

Borda requested a review from GdoongMathew

September 1, 2025 11:32

Borda added 2 commits

September 1, 2025 13:40


          space

8c5e264


          ELLIPSIS

0477fe8


          +ELLIPSIS

acc6537

Borda merged commit 3d56296 into Lightning-AI:master

24 checks passed

Collaborator

Borda commented Sep 1, 2025

@GdoongMathew, it has landed, so now you can send your suggestions with improvments 🚀

GdoongMathew approved these changes

View reviewed changes

This was referenced Sep 1, 2025

docs: move on_fit_end hook to teardown stage. #21143

Merged

docs: update optimizer_zero_grad order, and the backward pass. #21144

Merged

docs: update setup and restore. #21149

Merged

Borda added a commit that referenced this pull request


          Docs on hook call order (#21120)

4dbacdc

* add hook order
* add to index
* Apply suggestions from code review
* BoringModel
* testoutput

---------

Co-authored-by: Jirka Borovec <[email protected]>
Co-authored-by: Jirka B <[email protected]>
(cherry picked from commit 3d56296)

lantiga pushed a commit that referenced this pull request


          Docs on hook call order (#21120)

8fa457f

* add hook order
* add to index
* Apply suggestions from code review
* BoringModel
* testoutput

---------

Co-authored-by: Jirka Borovec <[email protected]>
Co-authored-by: Jirka B <[email protected]>
(cherry picked from commit 3d56296)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

Borda Borda approved these changes

lantiga Awaiting requested review from lantiga lantiga is a code owner

tchaton Awaiting requested review from tchaton tchaton is a code owner

justusschock Awaiting requested review from justusschock justusschock is a code owner

ethanwharris Awaiting requested review from ethanwharris ethanwharris is a code owner

+1 more reviewer

GdoongMathew GdoongMathew approved these changes

Labels

docs pl